LCR-Net++: Multi-person 2D and 3D Pose Detection in Natural Images

نویسندگان

Gregory Rogez

Philippe Weinzaepfel

Cordelia Schmid

چکیده

We propose an end-to-end architecture for joint 2D and 3D human pose estimation in natural images. Key to our approach is the generation and scoring of a number of pose proposals per image, which allows us to predict 2D and 3D poses of multiple people simultaneously. Hence, our approach does not require an approximate localization of the humans for initialization. Our Localization-Classification-Regression architecture, named LCR-Net, contains 3 main components: 1) the pose proposal generator that suggests candidate poses at different locations in the image; 2) a classifier that scores the different pose proposals; and 3) a regressor that refines pose proposals both in 2D and 3D. All three stages share the convolutional feature layers and are trained jointly. The final pose estimation is obtained by integrating over neighboring pose hypotheses, which is shown to improve over a standard non maximum suppression algorithm. Our method recovers full-body 2D and 3D poses, hallucinating plausible body parts when the persons are partially occluded or truncated by the image boundary. Our approach significantly outperforms the state of the art in 3D pose estimation on Human3.6M, a controlled environment. Moreover, it shows promising results on real images for both single and multi-person subsets of the MPII 2D pose benchmark and demonstrates satisfying 3D pose results even for multi-person images.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Single-Shot Multi-Person 3D Body Pose Estimation From Monocular RGB Input

We propose a new efficient single-shot method for multiperson 3D pose estimation in general scenes from a monocular RGB camera. Our fully convolutional DNN-based approach jointly infers 2D and 3D joint locations on the basis of an extended 3D location map supported by body part associations. This new formulation enables the readout of full body poses at a subset of visible joints without the ne...

متن کامل

Hybridization of Facial Features and Use of Multi Modal Information for 3D Face Recognition

Despite of achieving good performance in controlled environment, the conventional 3D face recognition systems still encounter problems in handling the large variations in lighting conditions, facial expression and head pose The humans use the hybrid approach to recognize faces and therefore in this proposed method the human face recognition ability is incorporated by combining global and local ...

متن کامل

Generative 2D and 3D Human Pose Estimation with Vote Distributions

We address the problem of 2D and 3D human pose estimation using monocular camera information only. Generative approaches usually consist of two computationally demanding steps. First, different configurations of a complex 3D body model are projected into the image plane. Second, the projected synthetic person images and images of real persons are compared on a feature basis, like silhouettes or...

متن کامل

Towards Accurate Markerless Human Shape and Pose Estimation over Time

We address the problem of accurately estimating human shape, pose, and motion from images and video without markers or special cameras. Existing methods often assume known backgrounds, static cameras, and sequence specific motion priors. Here we propose a method that is fully automatic and, given multi-view video, estimates 3D human motion and body shape. Our work is built upon the recent SMPLi...

متن کامل

Towards Accurate Markerless Human Shape and Pose Estimation over Time

Existing markerless motion capture methods often assume known backgrounds, static cameras, and sequence specific motion priors, limiting their application scenarios. Here we present a fully automatic method that, given multi-view videos, estimates 3D human pose and body shape. We take the recently proposed SMPLify method [12] as the base method and extend it in several ways. First we fit a 3D h...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2018

LCR-Net++: Multi-person 2D and 3D Pose Detection in Natural Images

نویسندگان

چکیده

منابع مشابه

Single-Shot Multi-Person 3D Body Pose Estimation From Monocular RGB Input

Hybridization of Facial Features and Use of Multi Modal Information for 3D Face Recognition

Generative 2D and 3D Human Pose Estimation with Vote Distributions

Towards Accurate Markerless Human Shape and Pose Estimation over Time

Towards Accurate Markerless Human Shape and Pose Estimation over Time

عنوان ژورنال:

اشتراک گذاری